1 

51 
101 



GGCGTAATCT GCTGCTTCa AACMAAAM KACCKTAC aaffiTGGT 
TTGTrrGCCG GATCAAGAGC TACCAACTCT TTTrCCGAAG GAACTGGCTT 
101 aGATACCAA ATACTGTCCT Ta» 

isi GCCACacn CAAGAACTCT GTAGCACCGC CTAaTACCT CGCTCTGCTA 
Z aCTGO^GC TGCCAGTOX GATMGIGGT GTCTTACCGG 

251 
301 
351 

401 CiAUMAUU-Vj ^v^vju.r^,.- . ^.^^TTTA -r^rTrrTrT^ 

,51 GCACGAGO^ GCTTCCA^ GGAAACGCCT « A^ 



251 g™ AGACGATAGT TACCGGATAA GCCCaGCGG TC™ 
'gI^^^G 'Z^mC CACG^C^ ACAGGAGAGC 



fGGGGffiTTC GTCaaaC CCaGCTTGG AGCGAACGAC CTACACCGAA 
MC ^GCGTGA GaATGAOAA AGCGCCACGC TTCGCGAAGG 



751 « C^ ttC« aCC^TCACT TTCAAAAGTC CGAAAGAAK 
8^ Sa-TOTCTCTO GAGGTCGCTG AGTAGTGCGC GAGTAMA^ 
Z CAAGGCAAGG CTTGACCGAC MTTGCATGA AGMTCm 



SOI ^i^^c AcacTGAa tga™ tttttgtgat gc™ 

Isol MCG «C ATM CGmTGGCA 



551 GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCCGAATTa CC^XTCTT 
™m ATGACOTAT GGGACTITCC 



SOI TC=AAa™CC™CA« 

'''' lc™;=cr«^ 



651 «AGG CCAGTMAAG CATTACCCGT CGmC 



,01 ^rATG^-C^aaCTAAAT^^^^^ 
'''' ra» IS^^TAG GGACmca TO^^^ 



901 ImmGG CGTTITGCGC TGCTTCajGA TMC A» 
=^0^ CS^CG^CCG CM 



951 GmCATO ATTATTGACT AGnAKAAT ACTM TCM TATO 
HAGTOTA GCCCATATAT GGAGTrCCGC GTTAOTMC TTAOTTAM 



1351 » »T AM ^f^^^^^ 
1401 TCCACCCCAT TGACGTCAAT GGGAGmGT TOa M^TCACffi 

u^l GAcrrrccAA aatgtcgtaa caactccxc ccattgacgc aaatgggcg. 

m™ COXACTGGG GAGTGGCGAG CCaCAGATG CTO^TATAA 

° Sot tt^ctgta aGGGTcia ac<:mGAC cAGATacAG 
fs^ ™^ ?a«rrAA ctagagaacc cAaGCTTM ™ 

1651 AGOTCTAGA GATCCaCGA CCTCGAGGGA TCnCOTAC aACCAGlTC 

FIG. 1A 



T^rrrrro-i P^TCGCGGCC GCGAaCTAG AGGATCTTTG TGAAGGMCC 
™™ ffiWT AArrffiACM ACTACCTACA GAGAmAAA 
^Sm nmAAGTG TATAATGTGT TAAACTACTG 

fS. mlimn mmca acctatggaa ctgatgaatg 

'''' ^J^™ SgCCT ™TGAGGA AAACCTGHT TGaaGAAG 

s ===== 

™S mSmAG aGCAaGCT ATACAAGAAA 

2101 ccrrrcTAT ™caccaw Zccmm agtaggcata AaGiiATAA 

2151 AmTGGM^ AATAnOGT MCCmA ^^^^^^^^^ 
™S TTAGCnm AATHGTAAA 

^S'S^ GATGTATAGT GCCITGACTA GAGATCATM 
tSaTAC CCATTTGTA GAGGTriTAC TTGCnTAAA AMCCTCCCA 

2901 TGGAGGCCTA GGCmTGCA AAMGCTaaT TC 



2201 
2251 
2301 
2351 
2401 



FIG. 1B 
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'''' 

80 THRASPASPbLNA. ^ .......TrTrAiiArrAAAnAtCTCCTGGACT (420) 



CYSlLEASNTHRTHRLEUtJiRLYSbLUVAUYii ^ 

200 ^[l^'z^^^^ (780) 



120 
140 
160 



220 
240 
250 
280 




FIG. 2A 



TmCMTAAAMMG(^™Ca^^^^ 
A(^TGmCTGTGTGCAGMCAnGTaCCTCCTm^ 
GaTCmGAACTCAGCCmGGTCMCATCTGGAG^^^ 
CATCAaCCAGTAAGGAGAAGCAATATAAGTGTGATTGCAAM^ 

ACAGAAATCtTAGAGAmcmCCCCTCTCAGGT»^^ 

TGAnGGTGtGCCTGGGTCtCACTACMGCAGCCTAOT 

cmTGTGCCCTGGTGGAdcnGCCCACCATCCTGTGAGTAAAAG^ 

TGAC (1504) 



(1020) 

(1080) 

(1140) 

(1200) j 

(1260)/ 

(1320) 

(1380) 

(1440) 

(1500) 



FIG. 2B 



1 GGCGTMTCT GCTGCTTCa MCAAAAAAA CCACCGCTAC CAGCGGTGGT 
51 TTGITTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT 
101 TaGCAGAGC GCAGATACCA AATACTGTCC TTCTAGTGTA GCCGTAGm 
151 GGCCACCACT TCAAGAACTC TGTAGCACCG caACATACC TCGCTCTCa 
201 aatcctg™ CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG 
251 GGUGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA 
301 ACGGGGGGK CGTGCACAa GCCaGCTTG GAGCGAACGA CCTAaCCGA 
351 AaCAGATAC CTACAGCGTG AGCATOGA AAGCGCCACG CITCCCGAAG 
401 GGAGAAAGGC GGAaGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG 
451 CGCACGAGGG AGCTOAGG ffiGAAACGCC TGGTATCm ATAGTCaGT 
501 CGGGtnCGC aCCTCTGAC TTGAGCGTCG A-^GTGA TGCTCGTCAG 
551 GGGGGCGGAG CCTATGGAAA AACGCaGO ACGCAAGCTA GCTCTAGCT 
601 AGAAATTGTA AACGHAATA imG^AAA ATCGCGm AAnTTTGH 
651 AAA-,CAGCTC AnWAAC CAATAOXCG AAATCGGCAA AATCCCmT 
701 AAATCAAAAG AATAGCCCGA GATAGGGTTG AGTGTTGITC aGTUGGAA 
751 CAAGAGTCCA CTATTAAAGA ACGTGO^CTC CAACGTCAAA GGGCGAAAAA 
801 CCGTCTATa GGGCGATOX CGCCCACTAC GTGAACCATC ACCCAAATa 
851 AG^GG GGTCGAGGTG CCGTAAAGa CTAAATCGGA ACCCTAAAGG 
ool GAGCCCCCGA mAGAGCH GACGGGGAAA GCCGGCGAAC GTGGCGAGAA 
051 AGGAAGGGAA GAAAGCGAAA GGA(XGGGCG CTAGGGCGCT GGCAAGTGTA 
1001 GCGGTCACGC TGCGCGTAAC CACCACACCC GCCGCGOTA ATGCGCCGCT 
1051 AaGGGCGCG TAaATGGTT (XTTTGACGA GCACGTATAA CGTGCTITCC 

FIG. 6A 



1101 TCCnGGMT CAGAGCGGGA GCTAMCAffi AGGCCGAm-AAGGGAmT 
1151 AGAaCGMC GGTACGCaG CTGGATCACC GCffiTCmC TCAACGTAAC 

1201 Acmao: oxW hgatatgat gcgccccgct tcccgataag 

1251 GGAOaGGCC AGTAAAAGa mCCCGTGG TGOGTrCCC GAGCGGCCAA 
1301 AGfflAGCAGA CTCTAAATCT GCCGTCATCG AGTOGAAGG HCGAATCCT 
1351 TCCCCaCCA CCAT«m CAAAAGTCCG AAAGAAKTG CTCCCTGCTT 
1401 GTGTGTIGGA GGTCGCTGAG TAGTdGCGA GTAAAATTTA AGCTACAACA 
1.51 AGGCAAGGCT TGACCGACAA TTM AATCTGCTTA GGGHAGGGG 
1501 WGCGCTG OTCGCGATG TACXGCCAG ATATACGCGT TGACAITGAT 
1551 TAlTGAaAG m^TAG TAATCAATTA CGffiGTGAn AGTrCATAGC 
IMl CCATATATffi AGTTCCGCGT TAaTAACn ACffiTAAATG GCCCKCTGG 
1651 CTGACCaCC AACGACCGCC KCCAnGAC GTCAATAATG ACGTATGTC 
1701 CaTAGTAAG GCCAATAGGG ACTTTCCATT GACGTCAATG GGTGGACTAT 
1751 mCGGTAAA- CTGCCCAOT OXAGTACAT GAAGTGTATC ATATOGAAG 
1801 TACGCGCCCT AHGAGGTa ATGACGGTAA ATGO:CCGCC TGGanATG 
1851 CCaGTACAT GACCTOIGG GACmCCTA cnGGaGTA GATCTACGTA 
looi TOGiaTCG aATOCCAT GGTGATGCGG TnTGGGAGT AOTCAATGG 
1^51 GCGTCmAG CGOmGAa CACGGGGA^ TCGAAGTaC CACCCCATTG 
2001 ACGTCAATGG GAGTTTGm TGGaCCAAA ATCAACGGGA CTTTCCAAM 
2051 TGTCGTAACA ACTCCGCCCC ATTGACGCAA ATGGGCGGAA TCaGGOT 
2101 GGACTGGGGA GTGGCGAGCC CTCAGATGGT GaTATAAGC AGaGC^ 
2151 TGCGTGTACT GGGiaCTa GGTTAGACCA GATCTGAGCC T03GAGCTCT 
2201 CTGGaAACT AGAGAACGCA CTKTTAAGC aCAATAAAG CnGTAGAGA 
2251 TCCaCGACC TCGAGATCa TOTGCTGGC GCffiATTCH TATaaGAT 

FIG. 6B 
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2301 MGTTGGTGG ACATATTATG TTTATCAGTG ATAAAGTGTC AAGCATGACA 

2351 AAGTTGCAGC CGAATACAGT GATCCGTGCC GCCCTAGACC TGHGAACGA 

2401 GGTCGGCGTA GACGGTCTGA CGACACGCAA ACTGGCGGAA CGGHGGGGG 

2451 naGCAGCC GGCGCTTTAC TGGCACTTCA GGAACAAGCG GGCGCTGCTC 

2501 GACGCACTGG CCGAAGCCAT GCTGGCGGAG AATCATAGa CTTCGGTGCC 

2551 GAGAGCCGAC GACGACTGGC GCTCATTTCT GACTGGGAAT GCCCGCAGCT 

2501 TCAGGCAGGC GCTGCTCGCC TACCGCCAGC ACAATGGATC TCGAGGGATC 

2551 nCCATACCT ACCAGTTCTG CGCCTGCAGG TCGCGGCCGC GACTCTAGAG 

2701 GATCITIGTG AAGGAACCTT ACTTCTGTGG TGTGACATAA HGGACAAAC 

2751 TACCTACAGA GATTTAAAGC TCTAAGGTAA ATATAAAATT TTTAAGTGTA 

2801 TAATGTGHA AACTACTGAT TCTAATTGTT TGTGTATTTT AGATTCCAAC 

2851 CTATGGAACT-aTGAATGGG AGCAGTGGTG GAATGCCTTT MTGAGGAAA 

2901 ACCTGTTnG CTCAGAAGAA ATGCCATCTA GTG^TGATGA GGCTACTGCT 

2951 GACTCTCAAC AHCTACTCC TCCAAAAAAG AAGAGAAAGG TAGAAGACCC 

3001 CAAGGACTTT CCTTCAGAAT TGCTAAGTTT TTTGAGTCAT GCTGTGITTA 

3051 GTAATAGAAC TCnGCTTGC TTTGCTATTT ACACCACAAA GGAAAAAGCT 

3101 GCACTGCTAT ACAAGAAAAT TATGGAAAAA TAHCIGTAA CCTTTATAAG 

3151 TAGGCATAAC AGTTATAATC ATAACATACT GTTTTTTCTT ACTCCACAGA 

3201 GGQTAGAGT GTGTGCTATT AATAACTATG CTCAAAAATT GTGTACCTTT 

3251 AGCTTTITM TITGTAAAGG GGTTAATMG GAATAHTGA TGTATAGTGC 

3301 CTTGACTAGA GATCATAATC AGCCATACCA CATTTGTAGA GGTTTTACTT 

3351 GCTTTAAAAA ACCTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAATGAA 

3401 TGCAAHGIT GTTGTTMCT TGlTTAnGC AGCTTATAAT GGTTACAAAT 

3451 AAAGCAATAG aTCACAAAT HCACAAATA AAGCATTTTT HCACTGCAT 

FIG. 6C 



3501 TCTAGFGTG GTlTGTCCM ACTOTCMT GTATCTTATC ATGTCTGGAT 
3551 CaGTGGAAT GTGTGTCAGT TAffiGTGTGG AAAGTCCCa GGCTCCCaG 
3601 aGGaCAAG TATGCAAAGC ATGaTCTa AnAGTCAGC AACaGGTGT 
3651 GGAAAGTCCC CAGGaCCCC AGCAGGCAGA AGTATGCAAA GCATCaTCT 
3701 CAATOGTCA GCAACCATAG TCCCaCCCT AACTCCGCCC ATCCCGCCCC 
3751 TAACTCCGCC CAGTOCGCC CATOTCCO: CCCATGGaG ACTAATTTTT 
3801 TTTATTTATG CAGAGGCCGA GGCCGCCTCG GCCTCTGAGC TATTCCAGAA 
3851 GTAGTGAGGA GGCTTimG GAGGCCTAGG CmTGCAAA AAGCTAATC 

FIG. 6D 
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AGACTCTaffiCCTTOKAffiTGCGTCTTtaGrrCCCCTmCTTCGamaCGGG (50) 
GAGGAGGGGCTGGMCCCTAGCCaraTCAGGACAMGATOO^^ (120) 



10 
30 
50 



170 
190 



-18 

rrri irrriTTrrrTTrAAtTCAAGTAACAGGAAACAAGATTTTGGTGAAGaGTCGCCC (180) 

E^SSS^^via^RGLYASflYSlL^V^LYSGU^RO 




HlsSeRASPTYRf/ETASN^tTTHRPROARGARG-KU^LTrKU i nnnnuL. , o..^^ , n ^ 

CCCTAT(XCGCACCAC(rGAClTCGaGCCTATCqc™^^ 
PROT?XpROPROARGASPPh£ALMLATYRAR^ 

AGCCAGCCmGCaGCCCCCATCTGCTCAATATa^^^ 

CCATCTCCAGCCGGCCACCtCA(rCCCTGn(mCACCM^^^ 

ACTAGACCAAATATCAAGAtaTmCAGACTCTGAAATGAAGTAAA^^ 

GACAGGC(:AAGTC™CAGtGCCATGGCCCACAmCAACmCO\TGT^^^^ 

TGACTGAGMGTOGGGTAQVlAACAAAAAGGGAGTGGAnCTGG^^ 

FIG. 7A 



(240) 
(SCO) 
(360) 
(420) 



70 GLYLYSLEUGLYAS^X]LUSeRVALlHR^^lYRLtU.UNn:..NL^u, rr.v«^j^j^^^. 

,30 ^^^m^M^^^^SM '""^ 

TrGCTAGTAA(J\GTGGCC 
150 LbXBJValThrVai ' 



(720) 

(780) 

(840) 

(900) 

(950) 

(1020) 

(1080) 



(1140) 
(1200) 



CTaCTCACCT(^CATCTCAGT(:M(K:AMGTGT(^W^^ 
GMGAMGGCTAGGAMTaiTCCmGG^AMTGGGTGnTMTCm^^ 
GG^AAACGGGGTAAGteGTAGGGGGAGGGATAGGAAGACIATAm (1250) 
AAAaCTGTCTCCaCTCAtGAAATGAGCCACGTAGmCT^^^^ 
TAGmAGAAATACATAGACATTGTCrmATGAAnCTGATCAT^^^^ 
CCAAATGAGGGAmGGT(:AAATGAGGGAnCCCTCAAAGC^^^^ 
TGCmCCTCACTCCCTGTCATGAGACTTCAGTGTTAATGn^^^ 

GAATAAAATAGTTC (1514) 



(1320) 
(1380) 
(1440) 
(1500) 



FIG. 7B 



TAC«CCaGAGAffiaCAO:TGaCTCQCCCm(a^GAmGTffi^^^ 




GLYALALBJALA/^Ij1-N / 

TCTCCCTGCCCOTGAGaCAGAGCCCCCAGTCTGAGCCAaaACAGCCCm^^ 

GGGTGGGGTGCTGACCOXCTCCAGTGTCTGAGGACTCAmAAGAGAAGGWAAGGGT 

GGACCCGGTGGCGAGTGGCCGGGCaGTCCAGGaGGGCCKTGCmGffiAGGAAGAAG 

CCaCAGTaCGGAAOtGAGGACAGCAGCTCCCCCAAaCaCAGCCGGTGCCCAGATC 

TGCTCCATGCCCCGTAAGGCACCGTGTCmGGCGAaTGTCAGCCCTGGGaGTCTCAG 

GGCCCaCCATCCCCACCACTGTCCCCTGCAGGGAGGACAnCTCTGTCCTTCTGGCCAG 

ACTGATGGTGACAGCCaffiTCGTCCCA«GGT^ara^^ 



(50) 

(120) 

(180) 

(240) 
(300) 
(350) 
(420) 
(480) 
(540) 
(600) 




FIG. 8A 



(1140) 
(1200) 
(1260) 



GTACaGTGACCaGT&aCCCTOIACGTCCCGCCTGraTCCCCC^^^ 
nTyrGlnEnd 

GCCCCACCAt(XCCCC(:ACCCT(X(:ACAa^^^ 
CAGAGmGAAGGGCC(:AGCCCTGCCCAGCTC(jUGCA^^^ 
CCCACGGTGCnCTCAGTGGACAATGATGCCTCCTCCGGGAA^^^^ 
ACGCCGCCACCGGGAGGAAGCCTGACTGTCCTnGGCTG6\TCTCCC(lAC^^^ 
GAGGGCrmCTGTGGGATGGGCCTGGCACGCGGCCCTCtCCTGT^^^^ (1560) 
CACCAGCAGGCCCCCAACCCCCAGGaGCCCGGCIAGAGGACGGGAGGAGA^^^^ (1620) 
ACCCAGCCGtACCAGAAATMAGGCTTCTGTGCTTCAAAAAAAAA (1665) 



(1320) 
(1380) 
(1440) 
(1500) 



FIG. 8B 
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.■ ^ ^..^..,r.\rrrr^ATrJr^TTCCCACCt (240) 




U^ARGS£RPROGL^RASPSeRiLEbLNlKKr^^x^non.^ 

:...^....^rTrTTrTriTrrAArrrTGTGACCATCACTG (500; 
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230 
250 
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(1020) 
(1030) 
(1140) 
(1200) 
(1260) 
(1320) 
(1380) 



TrArTCTOCTCCCMraC(:ATGTO(:AGTMTMCIAM(»GTMCG™TGCOT^ 

TGGTaTACTCTaGCTTGCTGAGTGGATGACMMAGAGGGGAAnGTTAMGGAAMT 

TTAMTGGA(»aGGAAAMTCCTGAGCAMCAAMCCACCTGGCCCrrAGAMTAa^ 
TAA(jrrGCTTAAACTACAAACA(:MGCAAAACTTCACGGGGraTACTAaTACAAGa 
TAAGCAAAACnAACTOGATamCTGGTAAATGOTATGTOGWATAAGACAACCC 
CAGCCMTCACAAGCAOCCTACTAAaTATAATOGGTGACTAGGGACmCTAAGAAGA 
TACCTACCCCCAAAAAACAATTATGTAATtGAAAACCAACCGAnGCCTrrATmGCTT 
CCACAmrCCCAATAMTACTTGCCTGTGACAmGCCACTGGAACACTAAACnCAT 
GAAnGCGCCTaGATmCCrrrAAaTCr^mTamGAaGAGTCTCAATOG (1440 
™cCAGGCTGGAGTG<:AmTATCTTGGCTaCTGCAAAGCCGCaCCCA«3Tr 1^ 
TAAGCGAnaaTGCCTCAGCaCCaGtAGCTGGGA-rtAGAGGaTGK^aTCATAC 1560 
CCAGCTAAmTGTAIITmAWarmTAGTAGAGACAGGGmcarAATGTT 

G<XCAGGCCGATCTCGAACTtCTG(XCTCTAGCGATCTGCCCGCCTC^^^^^ 
GCTGGGATGACaGaTaGCCCCAATGTCaGCCTCmAAaTCTOmCCTATGCC 

CTCTCTGTGGATCCCTACTGCTGGmCTGCCrrCTCaTGCT(lAGAACAAAATCACCTA 

rraCTGCrtATGCAGTCGGAAGCTCCAGAAGAACAAAGAGCCCAATOCCAGAACaa 

TTAAGTaCCATOimGCCTTGGGArrtGAGAAMAGAGAGGTGAGGATaGG 

TAmCCTGGAaAAATrCCCCnGGGGAAGACGAAGGGATGCTGCAGTOCAAAAGAGA 

AGGACTOTMCATCTACCTGAGTCCCAAAGacaTGTCCTGAAAGCaaGAC 

AATAmCCCAAATGAaGACTGaCCnaGTOGCCGmOTCAAGAA 

T(:^CTGlTCCA<:ATC(:A(:AaGCCAATACAATOGTCAMCaCTGTTA™CAGATG 

TAGCAACATGAGAAACKTtATGTTACAGGraCATGAGAGCAATaTGTAAGTCTATAT 

GACnCAGAMTGTTAAAATAGACTAACCTCTAACAACAAAFAAAAGTGAnGmCAA 

GGTGAAAAM (2290) 

FIG. 9-B 
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1 . .GGAGAGTC TGACCACCAT GCCACCTCCT CGCCTCCTCT TOTCCTCCT 

51 CTTCCTCACC CCCATGGAAG TCAGGCCCGA GGMCCTCTA GTGGTGAAGG 

101 TGGAAGAGGG AGATMCGCT GTGCTGaCT GCCTCAAGGG GACCTCAGAT 

151 GGCCCCACTC AGCAGCTGAC CTGGTaCGG GAGTCCCCGC TTAAACCCTT 

201 CTTAAAACTC AGCCTGGGGC TGCaGGCCT GGGAATCCAC ATGAGGCCCC 

251 TGGCCATCTG GCtnTCATC TTCAACGTCT CTCAACAGAT GGGGGGCTTC 

3oi TACCTGTGCC AaCGGGGCC CCCCTaGAG AAGGCCTGGC AGCCTGGCTG 

351 GAMTCAAT GTGGAGGGa GCGGGGAGCT GTTCCGGTGG AATGTTTCGG 

401 ACCTAGGTGG CCTGGGCTGT CaCTGAAGA ACAGGTCCTC AGAGGGCCCC 

451 AGCTCCCCTT CCGGGAAGCT OTGAGCCCC AAGCTGTATG TGTGGGCCAA 

501 AGACCGCCCT GAGATCTGGG AGGGAGAGCC TCCGTGTGTC CCACCGAGGG 

551 AaGCCTGAA CCAGAGCCTC AGCCAGGACC TaCaTGGC CCCTGGCTCC 

601 AaCTCTGGC TGTCCTGTGG GGTACCCCCT GACTCTGTGT CCAGGGGCCC 

651 CCTCTCCTGG ACCCATGTGC ACCCCAAGGG GCCTAAGTCA TTGaGAGCC 

701 TAGAGCTGAA GGACGATCGC CCGGCaGAG ATATGTGGGT AATGGAGACG 

751 GGTCTGTTGT TGCCCCGGGC CACAGCTCAA GACGCTGGAA AGTAmiTG 

801 TCACCGTGGC AACCTGACtt TGTCATTCa CaCGAGATC ACTGaCGGC 

851 CAGTACTATG GCACTGGCTG CTGAGGACTG GTGGCTGGAA GGTCTCAGCT 

901 GTGACmGG CTOTaGAT CTTCiaCTG TGTCCCTTG TGGGCATCT 

951 TCATCTTCAA AGAGCCCTGG TCCTGAGGAG GAAAAGAAAG CGAATGACTG 

im ACCCCACaG gagattcttc aaagtgacgc ctcccccagg aagcgggccc 

1051 aCAACaGT ACGC-Q\ACGT GCTGTCTCTC CCaCACCCA cctcaggcct 

1101 CGGACGCGCC aGCGTTGGG CCGCAGGCGT GGGGGCaCT GCCCCGTCTT 
1151 ATGGAAACCC GAGCAGCGAC GTCCAGGCGG ATGGAGCCTT GGGGTCCCGG 

FIG. 12-A 



1201 AGCCGCCGGG AGTGGGCCCA 

1251 CCTGACAGTG AGGAGGACTC 

1301 GCAGGACCAG CTCTCCCAGG 

1351 AGCCCCTGGG TCCTGAGGAT 

1401 GAGAACGAGG ATa^AGAGCT 

1451 GCTGAGCCCT aiGGGTCAG 

1501 TGGGGTCCCA GTCCTATGAG 

1551 CAGCTCCGCT-GCAnCGGGG 

1601 AGACTCITAT GAGAACATGG 

1551 GAGGAGGGGG CCGCATGGGC 

1701 CAGCCTGGAT CTCCTCAAGT 

1751 GAAGACCTCG AGCAGATGAT 

1801 GTGTG-CATGT GTGTAAGTGT 

1851 ATACATGCCA GTGACACTTC 

1901 AATGAGCTCT TCCAAAAAM 



GAAGAAGAGG AAGGGGAGGG CTATGAGGAA 
CGAGHCTAT GAGAACGACT CCAACCTTGG 
ATGGCAGCGG CTACGAGAAC CCTGAGGATG 
GAAGACTCCT TCTCCAACGC TGAGTCTTAT 
GACCCAGCCG GTCGCCAGGA CAATGGACTT 
CCTGGGACCC CAGCCGGGAA GCAAGCTCCC 
GATATGAGAG GAATCCTGTA TGCAGCCCCC 
CCAGCCTGGA CCCAATCATG AGGAAGATGC 
ATAATCCCGA TGGGCCAGAC CCAGCCTGGG 
ACaGGAGGA CCAGGTGATC CTCAGGTGGC 
CCCCAAGAn CACACCTGAC TGTGAAATCT 
GCCAACCTCT GGAGCAATGT TGCHAGGAT 
GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 
CAGTCCCCTT TGTATTCCTT AAATAAACTC 
AAAA 



FIG.12-B 



1 ACAAAGACAA ACTCaCCa CTGAACTCCG CAGCTAGCAT CCAMTCAGC 
51 CCTTGAGATT TGAGGCCTTG GAGACTaGG AGTITTGAGA GCAAAATGAC 
,01 AAaCeaCA AAiraGTAA ATGOACm CCCGGCAGAG CCAATGAAAG 
151 GCCCTATO TATGCAATCT GGTCCAAAAC CACTCTTCAG GAGGATGTCT 
201 TCACTGGTGG GCCCaCGCA AAGCTOTTC ATGAGGGAAT CTAAGACm 
251 GGGGGCTGTC CAGATTATGA ATGGGCTCTT CCACAITGCC CTGGGGGGTC 
301 mCATGAT CCaGCAGGG ATCTATCaC CCATaGTGT GACTGTGTGG 
351 TACCCTCTCT GGGGAGGCAT TATGIATAH AmCCGGAT aCTCCTGGC 
401 AGCAACGGAG AAAAACTCa GGAAGTCm GGTCAAAGGA AAAATGATAA 
451 TGAAnCATT GAGCacm CTGGAATGAT TCmCAATC 

501 ATGGACATACnAATAlTAA AAITTCCCAT miTAAAAA TGGAGAGTCT 
.51 GAAmTAn AGAGCTCACA CACaTATAT TAACATATAC AACTGTGAAC 
601 aCCTAATCC CTCTGAGAAA AACTCCCCAT CTACCCAATA CTGTTACAGC 
651 ATACAATCTC TGTTCnGGG armCTCA GTGATGCTGA TCTnGCCU 
701 CnCCAGGAA CTOTAATAG CTGGCATCGT TGAGAATGAA TGGAAAAGAA 
751 CGTGCTCCAG ACCCAAATCT AACATAGTTC TCCTGTCAGC ACAAGAAAAA 
801 AAAGAACAGA CTAHGAAAT AAAAGAAGAA GraiTGGGC TAACTGAAAC 
851 ATCmCCAA CCAAAGAATG AAGAAGACAT TGAAAnATT CCAATCCAAG 
ool AAGAGGAAGA AGAAGAAACA GAGACGAACT TTCCAGAACC TCCCGAAGAT 
C51 aCGAATCCT aCCAATAGA AAATGAaGC TCTCC™g TGATTTCnC 

1001 TGmcTGT nccmm aaacahagt gtotagct tccaagagac 

1051 ATGCTGACn lamCKG AGGTACTCTG CAOTACGa CCACATCTCT 

FIG. 13-A 



1101 ATCTGGCCTT TGCATGGAGT GACCATAGCT CCTTCTCTCT TACATTGAAT 

1151 GTAGAGAATG TAGCCAHGT AGCAGCHGT GHGTCACGC nCTTCTTTT 

1201 GAGCAACrrr CTTACAaCA AGAAAGGCAG AATGAGTGCT TCAGAATGTG 

1251 ATHCCTAa AACCTGTTCC TTGGATAGGC TTTTTAGTAT AGTATTnTT 

1301 TTTGTCAITT TCTCCATCAG CAACCAGGGA GACTGCACCT GATGGAAAAG 

1351 ATATATGACT GCTTCATGAC AHCCTAAAC TATCTTTTTT HATTCCACA 

1401 TCTACGTirr TGGTGGAGTC CCmTTATC ATCCTTAAAA CAATGATGa 

1451 AAAGGGCTTT AGAGCACAAT GGATCT 



FIG.13-B 



• 



1 CCCAMTGTC TCAGAATGTA TGTCCCAGM ACCTGTGGCT GCTTCAACCA 

51 TTGACAGITT TGCTGaCCT GGCTTCTGCA GACAGTCAAG CTGCAGCTCC 

101 CCCAAAGGCT GTGCTGAAAC TTGAGCCCCC GTGGATCAAC GTGCTCCAGG 

151 AGGACTCTGT GACTCTGACA TGCCAGGGGG CTCGCAGCCC TGAGAGCGAC 

201 TCCATTCAGT GGTTCCACAA TGGGAATCTC ATTCCCACCC ACACGCAGCC 

251 CAGCTACAC-G TTCAAGGCCA ACAACAATGA CAC-CGGGGAG TACACGTGCC 

301 AGACTGGCCA GACCAGCCTC AGCGACCCTG TGCAlCTGAC TGIGCTITCC 

351 GAATGGCTGG TGCTCCAGAC CCCTCACCTG GAGTTCCAGG AGGGAGAAAC 

401 CATCATGCTG AGGTGCCACA GCTGGAAGGA CAAGCCTCTG GTCAAGGTCA 

451 CATTCnCCA GAATGGAAAA TCCCAGAAAT TCTCCCGTTT GGATCCCACC 

501 nCTCCATCC CACAAGCAAA CCACAGTCAC AGTGGTGATT ACCACTGCAC 

551 AGGAAACATA GGCTACACGC TGHCTCATC CAAGCCTGTG ACCATCACTG 

601 TCCAAGTGCC CAGCATGGGC AGCTCTTCAC CAATC-GGGAT CATTGTGGCT 

551 GIGGfCAHG CGAQGCIGT AGCAGCCATT GITGCTGCTG TAGTGGCCn 

701 GATCTACTGC AGGAAAAAGC GGATTTCAGC CAATTCCACT GATCCTGTGA 

751 AGGCTC-CCCA ATITGAGCCA CCTGGACGTC AAATGATTGC CATCAGAAAG 

801 AGACAACTTG AAGAAACCAA CAATGACTAT GAAACAGCTG ACGGCGGCTA 

851 CATGACTCTG AACCCCAGGG CACCTACTGA CGATGATAAA AACATCTACC 

901 TGACTCTTCC TCCCAACGAC CATGTCAACA GTAATAACTA MGAGTAACG 

951 TTATGCCATG TGGTCATACT CTCAGCTTGC TGAGTGGATG ACAAAAAGAG 

1001 GGGAAITGTT AAAGGAAAAT TTAAATGGAG ACTGGMAAA TCCTGAGCAA 

1051 ACAAAACCAC CTGGCCCTTA GAAATAGCTT TAACTTTGCT TAAACTACAA 

1101 AOlCAAGCAA AACTTCACGG GGTCATACTA CATACAAGCA TAAGCAAAAC 

1151 TTAACTTGGA TCAITTCTGG TAAATGC1TA TGTTAG^AAT AAGACAACCC 

1201 CAGCCAATCA CAAGCAGCCT ACTAACATAT AAHAGGTGA CTAGGGACTT 

1251 TCTAAC^AGA TACCTACCCC CAAAAAACAA 1TATGTAATT GAAAACCAAC 

1301 CGATTGCCTT TATTTTGCTT CCACATTTTC CCAATAAATA CTTGCCTGTG 

1351 ACATTTTGCC ACTGGAACAC TAAACITCAT GAATTGCGCC TCAGATTnT 

1401 CCTTTAACAT CI MIIHM TTTGACAGAG TCTCAATCTG TTACCCAGGC 

1451 TGGAGTGCAG TGGTGaATC ITGGCTCACT GCAAACCCGC CTCCCAGGTT 

1501 TAAGCGATTC TCATGCCTCA GCCTCCCAGT AGCTGC-GATT AGAGGCATGT 

1551 GCCATCATAC CCAGCTAATT TTTGTAITTT TTATnTTTT TTTTTAGTAG 

1501 AGACAGGGTT TCGCAATGIT GGCCAGGCCG ATCTCGAACT TCTGGCCTCT 

1551 AGCGATCTGC CCGCCTCGGC CTCCCAAAGT GCTGGGATGA CCAGCATCAG 

FIG. 14-A 



1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 



CCCCAATGTC CAGCCTCTTT MCATCTTCT HCCTATGCC CTCTCTGTGG 
ATrrfricTG OGGTHCTG CCTTCTCCAT GCTGAGAACA AAATCACCTA 
ScGG AAGCTCaGA AGAACAAAGA GCCCAATTAC 

SSa HAAGTacc ATTGmr J cn™ OA™ 

TTAGAGAGGT GAGGMCTGG TAinCCTGG ACTAAATTCC CCTTGGGGAA 
ICaGCAGlT CCAAAACAGA AGCACTCTTC CAGM 
TLrnrtcrc CCAAAGCTCC CTGTCCTGAA AGCaCAGAC AATATffiTCC 
ScTC CTGTGCCTCA GCCGnCTTG ACATCAAGAA 

mm S^acA acccAATAC aattagtcaa ACCAac™ 

^Sg « AGAAACGCn » TTA™ 
GCAATaTGT AAGTaATAT GACTTCAGAA ATGTTAAAAT AGACTAACCT 



2251 CTAACAACAA ATTAAAAGTG ATTGTTTCAA GGTGAAAAAA 

FIG. 14-B 



1 

51 
101 
151 



GCTGTGACTG CTGTGCTCTG GGCGCCACTC GCTCCAGGGA GTGATGGGAA 
TCCTGTCATT cmCCTGTC CTTGCCACTG AGAGTGACTG GGCTGAcTgC 
AAGTCCCCCC AGCCTTGGGG TCATATGCTT CTGTGGACAG CTGTGCTATC 
CCTGGCTCCT GTTGCTGGGA CACCTGCAGC TCCCCCAAAG GCTGTGCTGA 



201 AACTCGAGCC ccagtggatc mcgtgctcc aggaggactc tgtgactctg 

m CCMCMCAA tgacagcggg gagtacacgt gccagactgg ccagaccagc 
401 ctcagcgacc ctgtgcatct gactgtgc it tctggtcagt ggaggaaggc 



251 acatgccggg ggactcacag ccctgagagc gactccattc agtggttcca 

CAATGGGAAT CTCATTCCCA CCCACACCa gcccacctac aggttcaagg 

v^/^ivj^ ^^^.^.^-T^^^ rrxnKcrsrr 



551 AGTOCCTa CaCATATa GTGGTOTTT TTGCCTCAGT TCTGATTGM 
CCCTGCCCCC GCCCCGCCa CCATGCCCCT TClCCACG^ CTaCTGTGC 



601 CAGAAGAAGG mCAAG(XC AAAAAOCff AGC» «GAG^^ 
«C »C SaGAm laGGTGAGA ATCAGTGTT 



651 GAAGGAAATC CCTAaGCAT AAAACCCA^ TCCAmTM TCaAWTT 
701 GAAAAGaa GACCACAAa GAATCaAGC CaGGAMTG ACTCACTA™ 
CAAOTGATG AAITCAITTA ACCCTTGAGT TrCCATTTCT TCACCTGCTC 



1501 
1551 



451 CCCAGGGTGG ACaCGGAGG GCaGGACGG ATGMATCTG cmCAGGCA 

901 lAbbl^i^i iv^iv^'^'^*' ^^.-r^ -roAAArrrrr 

951 ATGAAAGTCG TGTCraCCC ATCTrTGTAT TTCCAGCATC TGMA TGffl 

lOoI CAGAGCTTAA TAAATAim GCTGGAGAGG TOTGATCT lACMAGCT 

1051 CanGAAAG GTGGCTaCT GTAAAOAM GTTACAATCA GATTGTGATG 

ccTOC '^'c^na Tccmcac crcGAaiGG aagagatggc 

i4oi mr acOT mcacGAG aggcctgtcg 



501 GAGGmGa GGAAAGGGffi GTGGCCTGCT TACTGGGMG TATCGCTGTG 

™a TOOT OTac accCTGra gmacctgag 



1101 AACATTGTCC ITGTGGCTIT TCACnAGTC CCCTCCCITC ACCTGAAGAG 
n ^^GC TGAAAAGTAC AUGCAAACG AATGACCCAC tggtgacact 

^201 GHGCcrrrA gaccctgctg gaaagaagq ccacatot taacattccc 

1251 GAAGTAAAK TAiaGGTAG CA™G GTAACAmG nGCACAfrC 
ATGACrmC TACTGTCaC AAAGGaTAT GTCCTOTa TATOT 



FIG. 15-A 



GGGTCTCAGA caGAGCCM GACCTCCCGG GTCCTCTGCG GTTTTTTGTG 

^^.^r-rr^^An uccAGGAGG 

GCCTCTGGTC 
CCCGTTCGGA 



1701 


GGGTCTCAGA 


1751 


TCITTCAGAG 


1801 


GAGAAACCAT 


1851 


AAGGTCACAT 


1901 


TCCCAACTTC 


1951 


ACTGCACAGG 


2031 


ATCACTGTCC 


2051 


GGTCACTGGG 


2101 


TCTACTCaG 


2151 


TTTGITATCA 



FIG. 15-B 



1 GCCTCGCTCG GGCGCCCAGT GGTCCTGCCG CCTGGTCTCA CCTCGCCATG 

51 GITCGTCTGC CTCTGCAGTG CGTCCTCTGG GC-CTGCTTGC TGACCGCTGT 

101 CCATCCAGAA CCACCCACTG CATGCAGAGA AAAACAGTAC CTAATAAACA 

Isi GTCAGTGCTG TTCTTTGTGC CAGCCAGGAC AGAAACTGGT GAGTGACTGC 

201 ACAGAGTTCA CTGMACGGA ATGCCTTCCT TGCGGTGAAA GCGAAITCCT 

251 AGACACCTGG MCAGAGAGA CACACTGCCA CCAGCACAAA TACTGCGACC 

301 cCAACCTAGGGCTTCGGGTCCAGCAGAAGGOaCClCAGAAACAGACACC 

351 ATCTGCACCT GTGAAGAAGG CTGGaCTGT ACGAGTGAGG CCTGTGAGAG 

iOl CTGTGTGCTG CACCGCTCAT GCTCGCCCGG CTTTGGGGTC AAGCAGAHG 

451 CTACAGGGGT TTCTGATACC ATCTGCGAGC CCTGCCCAGT CGGCTTCnC 

501 TCCAATGTGT CATCTGCTTT CGAAAAATGT CACCCTTGGA CAAGCTGlGA 

551 GACCMAGAC CTGGTTGTGC AACAGGCAGGC ACAAACMGA CTGATGHGT 

601 CTGTGGTCCC CAGGATCGGC TGAGAGCCCT GGTGGTGATC CCCATCATCT 

651 TCGGGATCCT GTTTGCCATC CTCnGGTGC TGGTCTTTAT CAAAAAGGTG 

701 GCCAAGAAGC CAACCAATAA GGCCCCCCAC CCCAAGCAGG AACCCCAGGA 

751 GATCAATTTT CCCGACGATC 1TCCTGGCTC CAACACTGCT GCTCCAGTGC 

801 AGGAGACTTT ACATGGATGC CAACCGGTCA CCCAGGAGGA TGGCAAAGAG 

851 AGTCGCATCT CAGTGCAGGA GAGACAGTGA GGaGCACCC ACCCAGGAGT 

QOl GTGGCCACGT GGGCAAACAG GCAGITGGCC AGAGAGCCTG GTGCTGCTGC 

951 TGCAGGGGTG CAGGCAGAAG CGGGGAGCTA TGCCCAGTCA GTGCCAC-CCC 
CTC 



FIG. 16 



